NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Improving Visualization Interpretation Using Counterfactuals

https://doi.org/10.1109/TVCG.2021.3114779

Kaul, Smiti; Borland, David; Cao, Nan; Gotz, David (January 2022, IEEE Transactions on Visualization and Computer Graphics)

Full Text Available
Graph Ranking Auditing: Problem Definition and Fast Solutions

https://doi.org/10.1109/TKDE.2020.2969415

Wang, Meijia; Kang, Jian; Cao, Nan; Xia, Yinglong; Fan, Wei; Tong, Hanghang (October 2021, IEEE Transactions on Knowledge and Data Engineering)
null (Ed.)
Full Text Available
Deep Co-Attention Network for Multi-View Subspace Learning

https://doi.org/10.1145/3442381.3449801

Zheng, Lecheng; Cheng, Yu; Yang, Hongxia; Cao, Nan; He, Jingrui (April 2021, The Web Conference 2021)
null (Ed.)
Full Text Available
ADMIRING: Adversarial Multi-network Mining

https://doi.org/10.1109/ICDM.2019.00201

Zhou, Qinghai; Li, Liangyue; Cao, Nan; Ying, Lei; Tong, Hanghang (November 2019, ICDM)

Multi-sourced networks naturally appear in many application domains, ranging from bioinformatics, social networks, neuroscience to management. Although state-of-the-art offers rich models and algorithms to find various patterns when input networks are given, it has largely remained nascent on how vulnerable the mining results are due to the adversarial attacks. In this paper, we address the problem of attacking multi-network mining through the way of deliberately perturbing the networks to alter the mining results. The key idea of the proposed method (ADMIRING) is effective influence functions on the Sylvester equation defined over the input networks, which plays a central and unifying role in various multi-network mining tasks. The proposed algorithms bear two main advantages, including (1) effectiveness, being able to accurately quantify the rate of change of the mining results in response to attacks; and (2) generality, being applicable to a variety of multi-network mining tasks ( e.g., graph kernel, network alignment, cross-network node similarity) with different attacking strategies (e.g., edge/node removal, attribute alteration).
more » « less
Full Text Available
Local Partition in Rich Graphs

https://doi.org/10.1109/BigData.2018.8622227

Freitas, Scott; Cao, Nan; Xia, Yinglong; Chau, Duen Horng; Tong, Hanghang (December 2018, 2018 IEEE International Conference on Big Data (Big Data))

Local graph partitioning is a key graph mining tool that allows researchers to identify small groups of interrelated nodes (e.g., people) and their connective edges (e.g., interactions). As local graph partitioning focuses primarily on the graph structure (vertices and edges), it often fails to consider the additional information contained in the attributes. We propose a scalable algorithm to improve local graph partitioning by taking into account both the graph structure and attributes. Experimental results show that our proposed AttriPart algorithm finds up to 1.6× denser local partitions, while running approximately 43× faster than traditional local partitioning techniques (PageRank-Nibble).
more » « less
Full Text Available
AURORA: Auditing PageRank on Large Graphs

https://doi.org/10.1109/BigData.2018.8622563

Kang, Jian; Wang, Meijia; Cao, Nan; Xia, Yinglong; Fan, Wei; Tong, Hanghang (December 2018, 2018 IEEE International Conference on Big Data (Big Data))

Ranking on large-scale graphs plays a fundamental role in many high-impact application domains, ranging from information retrieval, recommender systems, sports team management, biology to neuroscience and many more. PageRank, together with many of its random walk based variants, has become one of the most well-known and widely used algorithms, due to its mathematical elegance and the superior performance across a variety of application domains. Important as it might be, state-of-the-art lacks an intuitive way to explain the ranking results by PageRank (or its variants), e.g., why it thinks the returned top-k webpages are the most important ones in the entire graph; why it gives a higher rank to actor John than actor Smith in terms of their relevance w.r.t. a particular movie? In order to answer these questions, this paper proposes a paradigm shift for PageRank, from identifying which nodes are most important to understanding why the ranking algorithm gives a particular ranking result. We formally define the PageRank auditing problem, whose central idea is to identify a set of key graph elements (e.g., edges, nodes, subgraphs) with the highest influence on the ranking results. We formulate it as an opti-mization problem and propose a family of effective and scalable algorithms (Aurora) to solve it. Our algorithms measure the influence of graph elements and incrementally select influential elements w.r.t. their gradients over the ranking results. We perform extensive empirical evaluations on real-world datasets, which demonstrate that the proposed methods (Aurora) provide intuitive explanations with a linear scalability.
more » « less
Full Text Available
Rapid Analysis of Network Connectivity

https://doi.org/10.1145/3132847.3133170

Freitas, Scott; Tong, Hanghang; Cao, Nan; Xia, Yinglong (November 2017, ACM CIKM)

This research focuses on accelerating the computational time of two base network algorithms (k-simple shortest paths and minimum spanning tree for a subset of nodes)---cornerstones behind a variety of network connectivity mining tasks---with the goal of rapidly finding networkpathways andtrees using a set of user-specific query nodes. To facilitate this process we utilize: (1) multi-threaded algorithm variations, (2) network re-use for subsequent queries and (3) a novel algorithm, Key Neighboring Vertices (KNV), to reduce the network search space. The proposed KNV algorithm serves a dual purpose: (a) to reduce the computation time for algorithmic analysis and (b) to identify key vertices in the network (\textit ). Empirical results indicate this combination of techniques significantly improves the baseline performance of both algorithms. We have also developed a web platform utilizing the proposed network algorithms to enable researchers and practitioners to both visualize and interact with their datasets (PathFinder: http://www.path-finder.io.)
more » « less
Full Text Available
explaining team recommendation in networks

https://doi.org/10.1145/3240323.3241610

Zhou, Qinghai; Li, Liangyue; Cao, Nan; Buchler, Norbou; Tong, Hanghang (January 2018, RecSys '18 Proceedings of the 12th ACM Conference on Recommender Systems)

State-of-the-art in network science of teams offers effective recommendation methods to answer questions like who is the best replacement, what is the best team expansion strategy, but lacks intuitive ways to explain why the optimization algorithm gives the specific recommendation for a given team optimization scenario. To tackle this problem, we develop an interactive prototype system, Extra, as the first step towards addressing such a sense-making challenge, through the lens of the underlying network where teams embed, to explain the team recommendation results. The main advantages are (1) Algorithm efficacy: we propose an effective and fast algorithm to explain random walk graph kernel, the central technique for networked team recommendation; (2) Intuitive visual explanation: we present intuitive visual analysis of the recommendation results, which can help users better understand the rationality of the underlying team recommendation algorithm.
more » « less
Full Text Available
FIRST: Fast Interactive Attributed Subgraph Matching

https://doi.org/10.1145/3097983.3098040

Du, Boxin; Zhang, Si; Cao, Nan; Tong, Hanghang (August 2017, KDD)

Attributed subgraph matching is a powerful tool for explorative mining of large attributed networks. In many applications (e.g., network science of teams, intelligence analysis, finance informatics), the user might not know what exactly s/he is looking for, and thus require the user to constantly revise the initial query graph based on what s/he finds from the current matching results. A major bottleneck in such an interactive matching scenario is the efficiency, as simply rerunning the matching algorithm on the revised query graph is computationally prohibitive. In this paper, we propose a family of effective and efficient algorithms (FIRST) to support interactive attributed subgraph matching. There are two key ideas behind the proposed methods. The first is to recast the attributed subgraph matching problem as a cross-network node similarity problem, whose major computation lies in solving a Sylvester equation for the query graph and the underlying data graph. The second key idea is to explore the smoothness between the initial and revised queries, which allows us to solve the new/updated Sylvester equation incrementally, without re-solving it from scratch. Experimental results show that our method can achieve (1) up to 16x speed-up when applying on networks with 6M$$+$$ nodes; (2) preserving more than 90% accuracy compared with existing methods; and (3) scales linearly with respect to the size of the data graph.
more » « less
Full Text Available
Adaptive Contextualization Methods for Combating Selection Bias during High-Dimensional Visualization

https://doi.org/10.1145/3009973

Gotz, David; Sun, Shun; Cao, Nan; Kundu, Rita; Meyer, Anne-Marie (December 2017, ACM Transactions on Interactive Intelligent Systems)

Full Text Available

« Prev Next »

Search for: All records